Stable Husbands 



Donald E. Knuth, Rajeev Motwani, and Boris Pittel 
Computer Science Department, Stanford University 

Abstract. Suppose n boys and n girls rank each other at random. We show 
that any particular girl has at least ( | — e) In n and at most ( 1 + e) In n different 
husbands in the set of all Gale/Shapley stable matchings defined by these rank- 
ings, with probability approaching 1 as n oo, if e is any positive constant. The 
proof emphasizes general methods that appear to be useful for the analysis of 
many other combinatorial algorithms. 

1. Introduction. This is a tale of n girls and n boys who play a game called "stable 
matching," invented by Gale and Shapley [3]. Each player ranks each player of the opposite 
sex according to preference; thus, there are n permutations of the set of boys, representing 
the preferences of the individual girls, and there are n permutations of the set of girls, 
representing the preferences of the individual boys. The object of the game is for the boys 
and girls to match up so as to obtain n marriages that are stable, in the sense that no girl 
and boy prefer each other to their current partners. 

For example, suppose Alice, Brigitte, Cindy, and Debra play with Wilfred, Xavier, 
Yuri, and Zeke; and suppose their preference rankings are as follows, from favorite to least 
desired: 

Alice likes Y > X > Z >W Wilfred likes A> B > D > C 

Brigitte likes X > W > Y > Z Xavier likes C > A> D > B 

Cindy likes W >Y > X > Z Yuri likes B > D > A> C 

Debra likes X > W > Z > Y Zeke likes B > A> C > D 

The matching {AW, BX, CY, DZ) is unstable, because for example A prefers Z to W and 
at the same time Z prefers A to D. But the matching [AZ, BW, CX, DY) is stable; most 
of the players are matched with a person other than their first choice, but the objects of 
their affections don't want to change. The given preferences also admit another stable 
matching, namely [AY, BW, CX, DZ). In this example only two matchings are stable. 

The stable husbands of a girl are the boys she can be married to in at least one stable 
matching. Thus, Alice's stable husbands in the example are Yuri and Zeke. Brigitte has 
only one stable husband, namely Wilfred; she likes Xavier better, but he can't stand her. 
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2. An algorithm. Gale and Shapley [3] gave a procedure to find a stable matching, 
given any set of preferences; McVitie and Wilson [10] extended the method so that all 
stable matchings would be found. More recently, Gusfield [5] exploited the interesting 
lattice structure of stable matchings to construct an elegant algorithm that simultaneously 
determines the stable husbands of all girls in O(n^) steps. For our purposes in the present 
paper it suffices to consider a simplified variant of these procedures, which finds the stable 
husbands of just one given girl G. 

The basic idea is to maintain partial matchings in which each boy who currently has 
a partner is paired with his best possible choice among all stable matchings of a certain 
class. One of the boys who doesn't have a current partner is temporarily called P; he will 
propose to one of the girls, and she will then decide whether to accept or to reject his 
proposal (at least for the time being). The role of P passes from boy to boy according to 
the following simple rules: 

AO. Initially all boys and girls are unpaired. 

Al. If at least one boy has no current partner, let P be one such boy. Otherwise all boys 
and girls are already paired, and we have a stable matching; output G's partner S 
as one of her stable husbands, then remove the pair GS from the current matching 
and let P = S. (Henceforth we will consider only stable matchings in which G is 
not married to S.) 

A2. If P has already proposed to all the girls, terminate the algorithm. Otherwise let H 
be the girl P likes best among all those he hasn't approached so far; P now proposes 
to H. 

A3. If girl H has already been proposed to by a boy she prefers to P, she rejects 
P's proposal. Otherwise, she accepts P, and they become paired in the current 
matching; her previous partner (if any) now assumes the role of P. If she had 
no previous partner, the algorithm continues at step Al, otherwise it continues 
at A2. © 
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For example, suppose we run this algorithm on the preference rankings given in the 
introduction, always choosing the alphabetically least boy when there is a choice in step Al. 
Let the special girl G be Alice. Then the following events occur: 
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terminate. 



Each girl is paired with the boy who has made her the best offer so far, except that 
the special girl Alice never has a partner in step A2 after the first stable matching has 
been found. The notation '(AZ)' in this chart means that Alice has no current partner, 
but that her best proposal so far has come from Zeke. In one place where ^{AY)' appears, 
A rejects X even though she is currently unattached, because she prefers y to X and she 
will not lower her previous standards. 

To prove that this algorithm finds all stable husbands of G, let us suppose for con- 
venience that exactly one proposal is made per unit of time, so that the tth execution of 
step A2 takes place at time t. Denote by Mf the set of all stable matchings such that G is 
not married to any of the S"s already output before time t. Then Mq is the set of all stable 
matchings, and t < t' implies that Mt D Mf. The correctness of the algorithm relies on 
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the following crucial fact: 



If girl H rejects a suitor R between time t and t + 1, 

(*) 

then the pair HR is not part of any stable matching in M^+i. 

The proof is by induction on t. If (*) fails for the first time at t, suppose H rejects R in 
step A3 because she prefers Q. (Either Q = P and R is her previous best partner, or R = P 
and Q is her previous best.) Then we are assuming that HR is part of a stable matching 
in Mt+i, and in this matching the stability condition tells us that boy Q must be paired 
with some girl J he prefers to H. But Quentin must then have proposed to Jane before 
he proposed to Helen, so he must have been rejected by J at some time t' < t. Therefore, 
by (*) and induction, JQ is not part of a stable matching from M^i , contradicting the fact 
that Mf 5 Mt_|_i. Rejection of Rhy H must therefore have occurred not in step A3 but 
in step Al; in other words, we must have H = G and R — S, a, stable husband. But then 
Mt^i does not include GS, by definition. Therefore (*) must be true. 

The matchings found in step Al must be stable. For if some girl H prefers boy U to 
her current partner, she has not yet been proposed to by U; hence he prefers his current 
mate. In fact, (*) tells us that the stable matchings found in Al are characterized by the 
property that each boy has his best choice among all matchings in Mf. Thus, when the 
algorithm outputs S, each boy has his best choice among all stable matchings such that G 
is paired with S. 

The algorithm terminates when some boy has been rejected by all the girls. According 
to (*), this happens at time t when = 0, i.e., when all the stable husbands S of G have 
been output. 
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3. A random model. We wish to show that the algorithm just stated will produce at 
least clnn outputs with probability approaching 1, if c is any given constant < |, assuming 
that the n!^" possible preference sequences are selected uniformly at random. 

The basic idea will be to use the principle of late binding, which also has been called the 
principle of "conservation of ignorance" or "deferred decisions" — le principe d'ajournement 
des decisions in [7]. Instead of fixing the preference sequences in advance, we simply let 
them unfold to whatever extent the algorithm needs them as it runs. Thus, whenever a boy 
is asked to propose, he proposes to a random girl chosen uniformly from among the girls he 
hasn't tried yet. Whenever a girl receives her kth proposal, she accepts it with probability 
1/k. A stochastic process with these characteristics is equivalent to the original algorithm 
running on random preference sequences, because it has the same transition probabilities 
between states. 

We can also simplify the algorithm further by assuming that each proposal is uniformly 
random, as if each boy has "amnesia" [7] and cannot remember any of the girls he has 
previously asked. If it turns out that he has just repeated himself, we will say that he has 
just made a redundant proposal; such proposals are always rejected. The algorithm now 
reduces to a fairly simple stochastic process, which uses the following data structures: 

Ai, . . . , An = sets representing the girls proposed to so far by boys 1 to n. 

I = number of boys who have played the role of proposer. 

p = the boy who is currently proposing. 

h = the girl who is currently being proposed to. 

xi, . . . ,Xn = boys who made the best offer so far to girls 1 to n, or zero if the girl has 

received no offer. 
ki, . . . , kn = number of proposals received by girls 1 to n. 

BO. Let Aj — 0, Xj — 0, and kj — 0, for 1 < j < n; also let I = 0. 

Bl. If I < n, increase I by 1 and let p = I. Otherwise output Xg, where g is the number 
of the special girl G; and let p = Xg. 

B2. Let /i be a random number, uniformly chosen between 1 and n. (We say that boy p 
has proposed to girl h.) If h e Ap (i.e., if p's proposal is redundant), repeat this 
step. Otherwise replace Ap by Ap U {h} and go on to step B3. 

B3. Increase kh by one. With probability 1 — 1/kh, return to B2 (we say that girl h 
rejects the proposal). Otherwise interchange p ^ Xh (she accepts the proposal and 
her former partner will have to propose to somebody else). If the new value of p is 
zero, OT if h = g and at least one output has already occurred, go back to step Bl; 
otherwise continue with step B2. © 

Algorithm B faithfully models the previous Algorithm A on random input, except for 
the redundant proposals. Notice that the new algorithm never terminates; step A2 stops 
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when P has nobody left to propose to, but step B2 keeps making redundant proposals 
ad infinitum when Ap = {1, . . . ,n}. The details of Algorithm B aren't extremely simple, 
but we will see that certain aspects of its probable behavior are fairly easy to analyze, in 
part because it never terminates. 

4. Probabilistic preliminaries. Algorithm B can be regarded as a branching process, 
an infinite tree with nodes at levels t — 1,2,3, . . . corresponding to the tth time step B2 is 
performed. Every node a in this tree corresponds to a unique path from the root, repre- 
senting one of the possible behaviors of the algorithm up to time t. This path determines 
the values of the data structures {Ai, . . . , An, I, pi, h, xi, . . . , x^,, ki, . . . , kn) at node a. 

Every node a has 2n children a", cti, . . . , ct^, ct^, where ck^ and represent the nodes 
following a proposal that has been accepted or rejected by h. U h E Ap, the transition 
probability from a to is and the transition probability from a to is 1/n; this case 
corresponds to a redundant proposal, which is always rejected. li h ^ Ap, the transition 
probability from a to ck^ is l/{kh, + l)n and the transition probability from a to is 
kh/{kh + where kh is the data value that becomes fc/^ + 1 in step B3. 

The probability Pr(Q;) of node a is the product of the transition probabilities on the 
path from the root to a; this is the probability that Algorithm B will take the compu- 
tational path represented by a. Since the transition probabilities from each node to its 
children sum to 1, the sum of Pr(Q;) for all nodes a on a given level t is 1. 

We say that an event occurs at node a with local probability p if p is the conditional 
probability of the event given that the algorithm reaches a. Thus, for example, the local 
probability that a proposal is accepted at a is 



the sum of the transition probabilities in which the event occurs. 

An event at a that depends only on the transition probabilities from a to its children 
will be called an immediate event. More general events may involve a sequence of node 
transitions in the subtree below node a; all such events have local probabilities at a as 
defined above. For example, we might speak of the local probability that at most five 
consecutive rejections immediately follow node a. Local probabilities at a are equivalent 
to unconditional probabilities in the branching process represented by the subtree whose 
root is a. 

Our proofs will often be based on a technique of probability estimation that can 
conveniently be called the principle of negligible perturbation. The idea will be to change 
the transition probabilities between certain nodes, obtaining a "perturbed" probability 
distribution Pr' on which it is relatively easy to compute the probability of some given 
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event. Let t be a level of the tree, and let E be the set of all nodes at level t such that 
the given event is true. Let C be the set of all nodes a at level t whose probability has 
been perturbed somewhere along the path from the root to a; thus, Pr(Q;) = Pi'{a) for all 
a ^ C. Summing over all o; ^ C and taking complements tells us that Pr(C) = Pr'(C). If 
Pr(C) is small, then the perturbation will have a negligible effect on the probability of E, 
because 



\Pt{E) -Pt'{E)\ 



^ Pr(a) - J2 Pr'(«) 



(Pr(a) -Pr'(a)) 

a£EnC 



< ^ |Pr(a) -Pr'(a) 
aec 



+ 



aec 



Pr'(a)| = 2Pr(C) 



Expected values can be estimated in a similar way. 

(The principle of negligible perturbation seems almost absurdly simple, but we will 
see that it simplifies our analyses in surprisingly nontrivial ways. The idea is similar in 
spirit to Laplace's method [9] of asymptotic analysis, where integrals are estimated by 
changing the integrand in unimportant portions of the domain. Another kindred method 
is Wilkinson's well-known technique of "backward error analysis" [13], in which numerical 
errors are conveniently studied by assuming that exact answers have been obtained from 
approximate data; the actual situation, in which approximate answers are calculated from 
exact data, is more difficult to handle directly.) 

Many of the proofs below are based on estimates of the tails of probability distribu- 
tions, using the following fundamental inequalities that we shall call the tail inequalities: 
Let 

P{z) =PQ+ PlZ + P2Z^ + • • • = E{z^) 

be the probability generating function (pgf ) for a random variable X that takes nonnegative 
integer values. Then 

Pr(X < r) < x~''P{x) for < a; < 1 ; 
Pr(X > r) < x-^'Pix) for a; > 1 . 

The proof is easy, since we have pk < x~'^pkX^ when < a; < 1 and k < r, and also when 
a; > 1 and k > r. In spite of this easy proof, the tail inequalities lead to quite effective 
bounds because we can often choose x to make x~'^P{x) small. 

(The history of these elementary inequalities takes us back to the early days of 
probability theory. Bienayme [1] and Chebyshev [12] observed that Pr((X — fj,)'^ > 
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r) < -^((X — //)^)/r for all r > 0. Kolmogorov [8] went further and remarked that 
Pr(X > r) < E(^f{X))/s for any nonnegative function f{X), provided that ex- 
ists and f{x) > s > for all X > r. In particular [8, equation 4.3.2], we get the second tail 
inequality when f{x) = e^^ and c > 0. Chernoff [2] pointed out the wide applicability of 
such estimates.) 

5. Probabilistic lemmas. Consider the behavior of Algorithm B as n — > oo. We 
will say that an event occurs almost surely, or 'a.s.', if the probability that it doesn't 
happen is o(l), i.e., if the probability of nonoccurrence approaches zero as n — > oo. We 
will also say that an event occurs quite surely, or 'q.s.', if the probability that it doesn't 
happen is superpolynomially small, i.e., 0{n~^) for all fixed K. If p(n) is any polynomial 
function, the sum of 0{p{n)) superpolynomially small probabilities is superpolynomially 
small; hence if m = 0{p{n)) and if the events Ei, . . . , individually happen q.s., the 
combined event 'Ei and . . . and E^ also happens q.s. 

Let N = [n^+^J, where < 5 < ^ is a constant. Throughout this section we shall 
consider only the first proposals made by Algorithm B. Thus, probabilities of events are 
measured by summing Pr(Q;) over all nodes a at time A^ + 1 such that the event occurs as 
the algorithm follows the path to a. 

Lemma 1. Each girl q.s. receives at least ^n^ proposals and at most 2n^ proposals (in- 
cluding redundant ones). 

The statement of this lemma and those below is deliberately somewhat ambiguous. One 
interpretation is that, if g is any particular girl, she q.s. receives the stated number of 
proposals. Another interpretation is that q.s. all n of the girls receive the stated number. 
The second statement is a corollary of the first, because of the nature of 'q.s.'; therefore we 
can prove each lemma using the first (weak) interpretation, but we can apply each lemma 
by using the second (strong) interpretation. 

Proof. Let g be one of the girls, and let Ek be the event that the A;th proposal is to g. This 
immediate event has local probability ^, because each proposal in step B2 is uniformly 
random. Therefore proposals to g are like Bernoulli trials with parameter ^, and the pgf 
for the total number of proposals received by g in the first N levels is simply 

Let r = ^n^. By the first tail inequality, the probability that g receives at most r proposals 
is at most 
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since 1 — x < e ^, and this is superpolynomially small. 

Similarly, if r = 2n^, the second tail inequality tells us that g receives r or more 
proposals with probability at most 

<2-Ml + -j < 2-^62^ 

since 1 + x < e^, again superpolynomially small. © 

Let us say that a boy begins a run of proposals when he becomes the proposer p 
in step Bl or B3; his run ends when one of his subsequent proposals is first accepted in 
step B3. In terms of the branching process, a run continues when a transition is from 
node a to a "rejected" node of the form a^, and it ends at a transition from a to an 
"accepted" node of the form cu^. 

Lemma 2. Each boy q.s. begins at most 2n^ runs of proposals. 

Proof. Let b be one of the boys. His first run of proposals begins just after I increases to b 
in step Bl; his subsequent runs occur just after p is set to Xg = b in step Bl or to Xh = b 
in step B3. Thus, at most two of his runs begin immediately after p becomes b in step Bl. 

The other runs occur when p becomes 6 = in step B3; and this can happen only if 
h is the girl who accepted b at the end of his previous run. Let Et be the immediate event 
that the proposal at time t is to the girl who has most recently accepted b, or to girl 1 if 
b has never yet been accepted. Then the number of runs begun by b is at most 2 plus the 
number of occurrences of Ef] in other words, b can begin r or more runs only if Ef occurs 
r — 2 or more times. But the local probability of -Et is ^, so again we have the binomial 
Pgf 

for the distribution of occurrences of Ef. 

We now complete the proof as in Lemma 1, by setting r = 2n^; the probability of r 
or more runs is at most x'^~'^P{x) for all a; > 1. And we have seen that this bound is 
superpolynomially small when x = 2. © 

Lemma 3. Each run q.s. contains at most n''(logn)^ non-redundant proposals. 

Proof We will prove that for any fixed time t, 1 < t < N, a run starting at t q.s. has the 
stated property. Let a be any node at level t, and let P{a, m, t) be the local probability 
that the proposals immediately following a will include at least m rejected non-redundant 
proposals before reaching time A'"+ 1 or before the first acceptance, whichever comes first. 



2-^P(2) = 2-^ 
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Then we have the recursive formulas 

( 1, if m = 0; 

0, if m > and t = 7V + 1; 

y PK,m,t+l) y fe,PK,m- M+l) ^^^^^^.^^ 
^ n ^ (kh + l)n 



P{a, m,t) = < 



According to Lemma 1, we may assume that kh < 2n^ for 1 < /i < n. (The vahdity 
of this assumption is discussed below.) Then it follows by induction on A'" + 1 — t that 

P{a, m,t) < 



2n^ + l 



If we now choose m = [n''(log n)^J , the local probability that there are more than m non- 
redundant proposals in a run starting at a is at most 

Multiplying by Pr(a) and summing over all a on level t gives a total probability of at most 
exp(— i(logn)^ + o(l)), which is superpolynomially small. © 

The previous proof uses a convenient simplification, indicated by the words 'According 
to Lemma 1, we may assume that . . . '. The assumption we are making holds q.s., but 
it is not always true; moreover, it is a probabilistic assertion about time AT + 1. So we 
should be careful that we are not fallaciously using the future to influence probability 
calculations in the past. A rigorous justification can be made by appealing to the principle 
of negligible perturbation: We simply recompute the transition probabilities when the 
assumption kh < is invalid. 

More precisely, if a is any node in the branching process, we let the perturbed tran- 
sition probabilities from a to ct^ and be + l)n and k'f^/{k'f^ + l)n, respectively, 
where 

k'h = min{kh, 2n^) . 

The proof of Lemma 3 is valid for the perturbed branching process, using k'j^ in place of kh 
in the formula for P{a, m, t). Thus, the proof establishes that each run in the perturbed 
branching process q.s. contains at most n'^(logn)^ non-redundant proposals. And this same 
conclusion also holds q.s. in the unperturbed branching process, because the probability of 
its falsity can increase by at most 2Pr(C), where C is the condition that some transition 
probability has been perturbed between the root and level A" -|- 1. Lemma 1 tells us that 
Pr(C) is superpolynomially small, because the path to a node a at level AT -|- 1 involves a 
perturbed transition probability only if some girl in the state represented by a has received 
more than 2n^ proposals before time A" -|- 1. 
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Lemma 4. Each boy q.s. proposes to at most 2n^^(logn)^ girls. 

Proof. Multiply the results of Lemmas 2 and 3. © 

Lemma 5. Each run q.s. contains at most n'^(logn)^ proposals. 

Proof. Let t be a fixed time, 1 < t < N, and let a be any node at level t. A proposal is 
rejected with local probability ^^^^^^ l/n + J2hi^Ap ^h/{kh + 

By the previous lemmas and the principle of negligible perturbation, we can assume 
that \\Ap\\ < 2n^^(logn)^ and kh < 2n^ . Let p = \\Ap\\/n. Then the local probability of a 
run continuing one more step is at most 

, 2n^ 2n^ + p 2n^ + 2n^^-^(logn)^ , 

^ + ^'-^^2;7TT = ^;7TT- = 

(If the assumptions fail and the local probability is actually greater than this number p', 
we can perturb it by artificially decreasing the probability of rejection and increasing the 
probability of acceptance. For example, we can define the transition probabilities from a 
to ct^ and a]^ to be respectively (1 — p')/n and p' /n^ for 1 < /i < n. The perturbed 
algorithm need not behave at all like the original algorithm does; for example, a boy's 
redundant proposals might be accepted with positive probability. The principle of negli- 
gible perturbations requires only that the nodes of the tree remain the same and that the 
transition probabilities be consistent with all assumptions of the proof.) 

Since 5 < |, the local probability of m consecutive redundant or rejected proposals is 
at most 

{pT < (i ^ 



for sufficiently large n. Hence we can complete the proof as in Lemma 3. © 

Notice that the principle of negligible perturbations has made it legitimate for us, in 
this proof, to estimate probabilities of events that start at time t by using assumptions 
that might fail at some future time > t. (Thus, ||^p|| might be < 277-^*^ (log n)^ at the 
beginning of a run but not at the end.) Arguments based on a weaker principle, which 
would require only that the assumptions hold at time t, would be more complicated; we 
would have to argue that cannot grow by more than 1 at each time step, and our 

upper bound would be (p' + m/ (n{2n^ + I)))'" instead of (p')"*. 

Lemma 6. Each boy q.s. makes at most 2n^^{logn)^ proposals. 

Proof. Multiply the results of Lemmas 2 and 5. © 

Lemma 7. Each boy q.s. proposes to a given girl at most log n times. 

Proof Let b be one of the boys and let j be one of the girls. Perturb the process so that 
after b makes n proposals, none of his subsequent proposals has positive probability of being 
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made to j. This perturbation is neghgible, because b q.s. makes fewer than n proposals 
(Lemma 6). 

Furthermore, if b hasn't made n proposals by time N + 1, pretend that he continues 
proposing until he has done it n times. This can only increase the number of proposals he 
makes to j. 

The pgf for the total number of proposals by b to j is then 

n — 1 + z 



n 



because b has amnesia; each of his n proposals is uniform among the girls. The probability 
that he has made more than log n of them to j is therefore at most 

(lnn)-^°s" ^ exp(-(logn)(lnlnn) + O(lnn)) , 

and this is superpolynomially small. © 

Incidentally, we have adopted here the convention of [4, p. 435] that 'log' is used for 
logarithms in contexts where the base is immaterial, while 'In' denotes the special case of 
natural logs. 

Lemma 8. Each girl q.s. receives at least |n'^/logn non-redundant proposals. 

Proof. A girl receives q.s. ^n^ proposals by Lemma 1, but at most logn from any one boy 
by Lemma 7. © 

6. The main theorem. We are almost ready to show that Algorithm B a.s. produces 
0(logn) outputs. (This final result will be "almost sure" but not "quite sure.") But first 
we need to analyze the time of the first output, because steps Bl and B3 change their 
behavior at that time. 

The first output occurs as soon as each of the n girls has received at least one proposal. 
We can prove that this q.s. happens long before time N — [n^~^^ \: 

Lemma 9. Let Nq = [nlnnlnlnnj . Each girl q.s. receives at least one proposal and at 
most Inn (Inlnn)^ proposals during the Erst Nq steps. 

Proof. The pgf for proposals to g satisfies 

/ n - 1 + x\^° 
P{x) = ( j < exp((a; — 1) Innlnlnn + o(l)) 

for all real x. The probability that g receives no proposal is P(0) < exp(— Innlnlnn + 
o(l)); the probability that she receives Inn (Inlnn)^ or more is at most 

2-inn(ininn)2p(^2) < exp (-(In 2) (In n) (In In n)^ + In n In In n + o(l)) . 

Both of these bounds are superpolynomially small. © 
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Lemma 10. Let e be a positive constant. A girl who has received m non-redundant 
proposals will accept at least (1— e) Inm of them, with probability l—0{'m~'^^/'^) as m — > oo. 
She will accept at most (1 + e) Inm of them with probability l — 0{m~'^ ^'^^^ as m — > oo. 
And she will accept at most m/ (Inm)^ of them with probability 1 — 0(exp(— m/ (Inm)^ + 
7m (In Inm)/ (Inm) "^)) as m — > oo. 

Proof. She accepts the kth. with probabihty 1/k, so the pgf for the total number of accep- 
tances is 




(The notation z— = z\/{z — w)\ for factorial powers is discussed in [4, p. 211].) The 
probability that she accepts fewer than (1 — e) Inm is at most 

(l_,)ln(l-e) rri<:-eV2 

(1 _ e)-(i-^)i^^pfl -e) = - < 

^ ^ ^ ' r(l-e)m^ -r(l-e)m^' 

and this is 0(m~^^/^) because m- = m,^ + 0{m,^~^). (See the answer to exercise 9.44 in [4].) 
Similarly, she accepts more than (1 + e) Inn with probability at most 

(1 + e)-(i+^) '"^P(l + e) < - 0(m-^ /6) . 

r(l + e) m— 

The probability that she accepts more than mo = m/(lnm)^ of them is at most 

mQ '^°P(mo) = exp(— mo In mo + lnr(m + mo) — Inr(mo) — Inm!) 
= exp(— mo(lnm — 6 In Inm + 0(1))) 

by Stirling's approximation. © 

Theorem. Assume that n girls and n boys have independent random preference rankings, 
and let G be one of the girls. Let c be a constant < | and let C be a constant > 1. Then 
G a.s. has at least clnn and at most Clnn stable husbands. 

Proof. The stable husbands of G are output by Algorithm A, which is equivalent to Algo- 
rithm B. The number of outputs is the number of times g accepts a proposal in Algorithm B, 
minus the number of times she accepts a proposal before the first output. 

We have shown in Lemma 8 that g will q.s. receive at least |n^/logn non-redundant 
proposals, among the first n^'^^ proposals made by Algorithm B, if 5 is any constant 
between and ^. Therefore, by the first estimate of Lemma 10, she will a.s. accept at 
least (1 — e)5lnn — O (log log n) proposals. 
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On the other hand, g receives at most n non-redundant proposals altogether. There- 
fore, by the second estimate of Lemma 10, she will a.s. accept at most (1 + e)lnn of 
them. 

Furthermore, by Lemma 9, the first output q.s. occurs before she has received m = 
Inn (In In n)^ non-redundant proposals. Therefore (by the third estimate of Lemma 10) 
she will accept at most 

m Inn / ^ ^ /logloglogn 

(lnm)3 Inlnn \ \ log log n 

before the first output, with probability 

._0(e.p(-ln..0(!-g^)))..--^. 

So the number of outputs will a.s. exceed clnn and be less than Clnn for all large n, 
if we choose 5 and e so that (1 — e)5 > c and 1 + e < C. @ 

7. Remarks. Inspection of the proof of the theorem shows that the conclusion holds 
with probability 1 — 0{n~'^), where 7 is any constant less than both (1 — 2c)^/2 and 
(C — 1)^/2 — (C — 1)^/6. We cannot improve this estimate to 1 — 0{n~^), because there is 
probability y/lnn/ n that the first proposal to G will come from one of her y/lnn favorite 
boys. In such a case she can have at most Vmn stable husbands, because the first stable 
marriage found by Algorithm A gives every girl her least preferred stable husband. 

Our theorem proves that random preferences a.s. guarantee an unbounded number 
of stable matchings, since every stable husband is part of at least one stable matching. 
Can it be shown that the a.s. lower bound of stable matchings grows faster than this, 
say as 17(logn)^? Pittel [11] has proved that the expected number of stable matchings is 
asymptotically e~^nlnn. However, Pittel's theorem does not prove that a large number of 
matchings will almost surely occur; constructions are known [6] where certain preference 
matrices give rise to at least 2"^"^ stable matchings, and such examples may be common 
enough to account for the relatively high expected value. 
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